DCX:UKENT RESUME 



ED 333 889 



IR 053 633 



AUTHOR 
TITLE 

INSTITUTION 

SPONS AGENCY 
PUB DATE 
CONTRACT 
NOTE 

PUB TYPE 

EDRS PRICE 
DESCRIPTORS 



IDENTIFIERS 



Wallace, Danny P.; And Others 

Age Analysis of Public Library Collections. Final 
Report . 

Louisiana State Univ., Baton Rouge. School of Library 
and Information Scienoe. 
Department of Education, Washington, DC. 
90 

R039A90006 
135p. 

Reports - Research/Technical (143) 
MF01/PC06 Plus Postage. 

-Decision Making; "Evaluation Criteria; Library 
Automation; Library Circulation; Library Collection 
Development; -Library Collections; Obsolescence; 
•Public Libraries; Statistical Analysis 
*Aging (Library Materials) ; -Weeding (Library) 



ABSTRACT 

The use of information regarding the ages of library 
items is a standard component of many approaches to weeding library 
collections, and has a long history in the literature of collection 
management. Current and past approaches to using aging information to 
make weeding decisions make use of very arbitrary decision criteria. 
This study examined the actual aging patterns of public library 
materials as a precursor to the production of a set of collection 
management procedures based on sophisticated analyses of aging data. 
The operational objectives of the study were (1) to develop a set of 
profiles of the aging characteristics of public library collections, 
based on analysis of a body of diverse public library holdings data; 
(2) to formulate a set of collection management guidelines based on 
the aging profiles, with detailed descriptions of appropriate 
analyses and their interpretation; and (3) to design an easy to use, 
flexible computer progreun to aid in the application of the 
guidelines. This report covers the problem statement and objectives 
of the study; the background and context of the study; approaches to 
weeding library collections; methodology; sample; data collected; 
data analysis; sample application of age data tc rollection analysis 
(Profile 1— the 90% retention rate and Profi]e 2— curve examination); 
comparison of the percentage and curve examination methods; 
continuation of the project; and dissemination of project results. 
Thirteen notes/references are included, and 109 figures are appended. 
(BEN) 
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Abstract 



The use of information regardini^ the ages of library items is a standard component 
of many approaches to weeding hbrary collections, and has a long history in the literature 
of collection management. Current and past approaches to using aging information to 
make weeding decisions make use of very arbitrary decision criteria. Ttiis study examined 
the actual aging patterns of public librar>' materials as a precursor to production of a set of 
collection management procedures based on sophisticated analyses of aging data. 
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AGE ANALYSIS OF PUBLIC LIBRARY COLLECTIONS 

FIN/J. REPORT. USDE GRANT #R039A90006 

Problem Statement and Objectives 

The planned removal of obsolete materials from public library collections is essen- 
tial to the maintenance of quality library service. Although many strategies lor identifying 
obsolete items have been published, none of them is bjised on an empirical examination of 
the aging characteristics of library collections. Selection of items for "weeding," or 
removal from the active collection, is an ongoing problem in public library operations. 
Library patrons generally need and desire items that are most up-to-daie. This is par- 
ticularly true of works in certain subject areas, such as science, technology, political 
science, and medicine, but is also true of all areas of the librf.jys collection. Unused items 
generate clutter on the shelves, and may have a seriously detrimental effect on the patron's 
assessment of the library and its collection. Space is generally at a premium in public 
libraries, and unused and unusable item:> compete for space with high-use, highly useful 
items. 

Current decision making regarding removal of items from public library collections 
focuses on items that have deteriorated physically, or makes us'i of arbitrary criteria for 
evaluating the use of materials or the age at which an item should be removed from the 
collection. The general objective of the project described here was to develop a more 
precise, scientific basis for making collection management decisions based on the aging of 
public library materials. 
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The project described here should be of specific benefit to public libraries and their 
patrons. The project makes use of data gathered from automated systems to provide ac- 
curate data regarding the aging of library collections, and to develop analytical tools for 
identifying aging patterns for library materials. Although most automated circulation sys- 
tems are cur/enlly capable of providing management reports pertaining to the ages of cir- 
culated and, sometimes, uncirculated materials, they are not currently capable of providing 
detailed analyses of such aging data. At the same time, librarians are generally not familiar 
with the basic methods for systematically analyzing the aging of library collections, and are 
therefore unable to develop effective, sophisticated strategies for utilizing the reports gen- 
erated by their circulation systems. 

The specific operational objectives of the study were 

1. to develop a set of profiles of the aging characteristics of public library col- 
lections, based on analysis of a body of diverse public library holdings data; 

2. to formulate a set of collection management guidelines based on the aging 
profiles, with detailed descriptions of appropriate analyses and their intei- 
pretation; 

3. to design an easy to use, flexible computer program to aid in the application 
of the guidelines. 

The objec.ives encompassed a combination of pure and applied research goals. At a pure 
research level, the aging patterns of public library collections are of interest simply because 
ihey are an unknown. From an applied, action oriented direction, the project was very 
deliberately aimed at the development uf collection management procedures based on age 
analysis. Success in achieving the project's objectives is the focus of this report. 
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The Background and Context or the Study 



Most experienced librarians are well aware that items in their collection tend to at- 
tract less use as they age. In many libraries this factor is used as a rationale for discarding 
older items, and the process of determining the point ai which an item has become so old 
thai it should be discarded is an important collection management issue. This aging 
process has also been observed in the context of the references included in scholarly publi- 
cations: most references tend to be to relatively recent publications, and the likelihood of 
a publication being cited appears to decline over time. This aging process is generally 
referred to under the bibliometric category of "obsolescence," although the aging of infor- 
mation sources and the obsolescence of a technolog)' or methodology clearly are :i0t 
directly analogous. When a piece of machinery' is said to be obsolescent, there is usually 
the implication that it has been replaced by a better piece of machinery and is therefore no 
longer of use. Obsolescence in bibliometrics, however, suggests only that older materials 
are not used, not that they are no longer useful. 

Obsolescence has usually been studied in the context of the circulation of items in a 
library collection, or of the citation of one body of literature by another. The results of ob- 
solescence studies are quite consistent: when items are ranked according to their age at 
the time they are used (circulated, requested, cited, etc.), recent items account for a very 
large proportion of the items used, while very old items receive very little use. The prin- 
ciples underlying the study of obsolescence are also of substantial potential use in library 
collection management: 

If documents are being considered, the interest is probably a practical one in the 
probability that an item will be required, as a guide to such questions as when to dis- 
card older volumes, how long to keep new ones, what sort of retrospective storage 
and access an information retrieval system should provide, and so on. 
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The relationship between obsolescence and use has not been adequately explored. Al- 
though use of the collection is an important goal for any library, age data may well con- 
stitute an important independent factor in collection analysis. The idea that older 
materials may legitimately be either discarded or relegated to some remote storage area is 
familiar to all librarians. The problem of employing strategies for discarding or moving to 
secondary storage lies in determining when an item is old enough to be removed from the 
primary collection, and decisions are frequently made on the basis of ad hoc rules of thumb 
or vague guesses. Determination of the actual patterns with which use of the collection 
declines over time can help make it possible to make more informed decisions and reduce 
the potential to. making incorrect decisions. The use of systematic obsolescence studies in 
collection management is at present hampered by the difficulty of gathering appropriate 
data. The Age Analysis of Public Library Collections project explored the ways in which 
machine-readable data can be used in analyzing the obsolescence of public library collec- 
tions. 



Approaches to Weeding Librarj' Collections 



Weeding library collections has a number of potential benefits, including effective 
utilization of space, efficient employment of staff and patron time, increasing the appeal of 
the library's collection, enhancing the reliability of the collection, monitoring the physical 
condition of materials in the collection, and providing a basis on which to evaluate the 
strengths and weakness of the collection.^ According to Lancaster, "weeding is necessary 
because the value of an item to a collection changes over time."^ It has also been sug- 
gested that effective weeding has a positive effect on circulation^ although no conclusive 
study of the relationship between weeding and circulation has yet been presented.^ 




Final Report, USDE Grant #RC39A90006, page 4 



Traditional approaches to weeding concentrate on evaluation of the physical ap- 
pearance of library materials, elimination of unnecessary duplication, evaluation of the 
content or format of materials, use, and aging.^ 

Evaluation based on physical or content considerations requires actual examination 
of the material under consideration. Systematic examination of an entire library's collec- 
tion is an overly time-consuming activity, and the number of items actually identified for 
removal may not justify such expenditure of time. Furthermore, these approaches require 
a substantial level of judgment. Physical deterioration alone is rarely an appropriate 
criterion for weeding, since such deterioration may accompany items that are old and rare, 
inherently fragile items that have been damaged by poor storage and/or handling, or high 
use items that need to be repaired or replaced rather than weeded. Evaluation of the con- 
tent of a work is always complex, and requires an intellectual judgment on the part of a 
skilled evaluator. Because evaluation of content is time-consuming and uncertain, it is of 
limited use as a primary means for identifying items for weeding. 

Use of library materials is a consideration that has long played an important part in 
collection management. Use is generally measured by circulation, and the fundamental as- 
sumption of use studies is that items that are not used should not be retained. The most 
common approach involves examining circulation for some fixed period of time, and tar- 
geting items that did not circulate during that tirne for removal from the collection.^ Al- 
though it impossible to argue against the principle that library materials are acquired and 
maintained with their use as a primary objective, there are several problems with noncir- 
culation as a primary criterion for weeding. Tht most obvious problems relate to identify- 
ing an appropriate sampling period for gathering circulation data and the balance between 
circulation and in-house use. The greatest limitation of use analysis is that circulation 
counts provide no information at all regarding the reasons why items are or are not used. 
Obsolete items may circulate, while nonobsolete items may not. In some cases, the ab- 
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sence of current materials may force library patrons to check out material that is out of 
date. Under other circumstances, the perception that a section of a collection contains 
mostly old material may discourage circulation of items in the section that arc not ob- 
solete. 

Current weeding methods based on age generally make use of rather arbitrary 
criteria for different classes of items. A classic discussion of this approach was McClcllan's 
methodology, which combined age categorizations with physical condition and use 
considerations.*^ A typical scheme offers suggestions such as discarding reference works 
after ten years or when they are superseded, maintaining works in religion and philosophy 
virtually forever, providing "frequent revision" for the social 5.ciences, and maintaining 
science and technology materials for no more than ten years.^ These age categories ap- 
pear to have been derived from the collective personal experiences of librarians, and there 
is no evidence of an empirical basis for them. There appears to be general agreement that 
libraries should discard old materials, but there is no systematic approach to defining 
"old." The project described here was designed to provide librarians with a sophisticated 
set cf procedures for determining what proportion of a collection can be considered old, 
and for identifying materials so old that they are likely candidates for weeding. 

In a more systematic approach, some set of characteristics of the collection itself 
would be used to select "target" items. The sample of target items would then be collected 
in one place, presumably by a page or other nonprofessional employee, and examined by a 
librarian or higher level nonprofessional to determine what action should be taken. The 
actions to be taken could include 

a. removal from the collection (the classic sense of the term "weeding"), 

b. conversion to an alternative forma: such as microreproduction r: an 
electronic medium, 

c. repair or replacement, 

d. enhancement to encourage use, 
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e. relegation to some secondary storage site, 

f. transfer to another library wnere use or usefulness might be higher, or 

g. retention of the item as is. 
Information that could contribute to the decision includes 

a. circulation or other use history, 

b. physical condition of the item, 

c. appraisal of the item's intrinsic value, 

d. evaluation of the item's value to the collection, 

e. language of the item, 

f. number of copies of the item held in the collection, 

g. availability of the item elsewhere, 

h. availability of the item in other formats, and 

i. index or bibliography coverage of the item. 

Although this process of examination can be done for a collection ^. whole, the time re- 
quired for doing so is often intimidating. A reasonable estimate of an efficient weeding 
process might require an average of twenty minutes of professional level time for each item 
examined. For the smallest library in the study database, with a total collection size of 
14,849 items, spending twenty minutes per item on the examination process would require 
4,950 person/hours, equivalent to more than two years of of professional level effort. For 
the largest library in the database, with a collection of 98,085 iicms, more than forty-seven 
years of effort would be required. This does not account for delays due to misshelving, 
items in circulation at the time sought for examination, or other delays in locating items. It 
also does not include the time required for assembling the materials prior to examination. 
Spending less time per item will accelerate the process and reduce person/hours but will 
inevitably reduce the quality of the decisions made. Although some libraries conduct 
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regular, complete inventories for weeding and other purposes, a substantial disruption of 
normal services is generally involved. Tor larger libraries, any inventory approach is simply 
overwhelming. 



Mcthodolog)' 

Data gathering and data manipulation are the major problems of a study of the 
aging characteristics of public library collections. Although gathering data directly from lo- 
cal automated systems for a sample of libraries might be desirable, such an approach 
presents problems of data compilation and comparability of data formats that would un- 
doubtedly be substantial. As a substitute for data frorr, local systems, this study used data 
from the OCLC (Online Computer Library Center) union catalog. The original plan of 
work entailed the following steps: 

1. Demographic data from the American Library Directory and the Public 
Library Data Ser\>ice Statistical Report were to be used to develop profiles for 
ten demographically diverse public libraries. Libraries would be selected on 
the basis of geographic location, size of collection, population of community 
and other demographic characteristics. The objective for giving considera- 
tion to demographic data was to identify librariec representative of a cross- 
section of U. S. public libraries. The desired demographic profile called for 
four libraries with collections of fewer than 20,000 items, three libraries with 
collections of between 20,000 and 70,000 items, and three libraries with col- 
lections of more than 70,000 items. Two libraries were to be located in the 
Northeast, three in the Midwest, three in the South, and two in the Wesi, 
Two wCi C 10 represen» nonmctropolitan areas, three to represent central city 
metropolitan areas, and five to represent suburban metropolitan areas. 
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2. OCLC, Inc. would match the ten profiles to ten public libraries represented 
in the OCLC online union catalog, and provide the investigators with a tape 
copy of the complete holdings data for each of the ten libraries. Data 
provided to the investigative team would not include identification of the ten 
libraries, thereby assuring confidentiality. 

3. The aging characteristics of the data would be analyzed as a prerequisite to 
development of a model for incorporating aging data into weeding decisions 
and design of specific techniques for doing so. The model was to be tested 
by the development of weeding profiles 

The original plan was modified in a number of ways, most of which arose from the 
selection of OCLC as a source of data. The demographic profiles originally developed 
proved to be too complex for easy matching to OCLCs database, and were therefore 
simplified to represent only rough size and geographic area considerations. Even then, 
drawing data for ten libraries was found to be a greater drain on OCLC resources than was 
anticipated, and a compromise sample of seven libraries was selected.^ 

Sample 

The nature of the sample of seven public libraries is summarized in Table 1. There 
were no very small libraries, because such libraries are not represented in the OCLC 
database. There were no very large libraries, due to the problem of selecting data from the 
database and delivering it in an acceptable format within a reasonable period of time; in- 
cluding larger libraries would have delayed the project further and probably would have 
exacerbated file conversion problems. In comparison to the desired demographic break- 
down, the sample has one fewer library in the under 20,000 holdings category and two 
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fewer in the more than 70,000 category. The libraries sampled can all be considered to be 
within the midrange of U. S. public libraries. All regions of the nation except the south 
were represented. No information on the communities served by the libraries is available, 
so the distribution among metropolitan, suburban and nonmetropolitan is unknown, since 
the identities of the libraries are not known to the researchers. 

TABLE 1: SAMPLE BREAKDOWTS' 



TOTAL USABLE 
LIBRARY REGION HOLDINGS HOLDINGS 



1 


Midwest 


14 


,849 


12,899 


2 


Midwest 


22 


,409 


20,828 


3 


Midwest 


98 


,085 


96,773 


4 


Northeast 


16 


,326 


14,892 


5 


Northeast 


43 


,260 


43,000 


6 


West 


18 


,224 


18,176 


7 


West 


37 


,190 


36,925 



TOTAL 250,343 243,493 



Data Collected 

The data provided by OCLC identified 250,343 publications held by the seven 
libraries. Missing data elements and other flaws in the sample data resulted in a usable 
database of 243,493 records. Each record consisted of the OCLC identification number for 
each item, a unique library identification number (a sequential number from one to seven), 
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the language of the publication, date information from the OCLC MARC fixed field, addi- 
tional date information from the $C subfield of field 260, and call number from either the 
082 or 092 field (with the 082 field preferred). These data can be summarized as follows. 

Language. Sixty-seven language codes were represented in the sample. English lan- 
guage publications were overwhelmingly predominant, accounting for 97.3% of the sample, 
including 99.4% of Dewey classified items and 90.5% of non-Dewey items. No other lan- 
guage accounted for as much as one percent of the sample. This paucity of foreign lan- 
guage obviates the utility of conducting a comparative analysis of the aging characteristics 
of publications based on language of publication. 

Date. Preference was arbitrarily given to the date from the OCLC fixed field. This 
differed from the $C/260 field date in only a very small number of cases. Some editing was 
required to eliminate obviously bogus dates, including dates in the distant future and dates 
represented by numbers of less than four digits. No attempt was made to correct these er- 
roneous dates; they were simply recorded as missing data. The oldest date recorded was 
for a non-Dewey item published in 1800; the oldest Dewey item was published in 1850. 
Date information was used to calculate ages of items, using 1990 as a base. Ages therefore 
ranged from zero years to 190 years, with a mean of 8.72. The mean for Dewey items was 
8.01 years, and the mean for non-Dewey items was 11.01 years. 

Call number. The sample was divided into two major groupings for purposes of 
analysis: Dewey and non-Dewey. Items with special additions to the call number field, 
such as "y^i" or "x", were consolidated with items not possessing these amendments. 
Dewey items were analyzed in depth. Non-Dewey items have as yet been analyzed only in 
a very simple manner due to the complexity of dealing with the variety of non-Dewey rep- 
resentations of the call number field. Counts of Dewey and non-Dewey items and the per- 
centage of Dewey items for each library are shown in Table 2. Dewey items accounted for 
between 73.8 and 85.1 percent of the collection, %'iih an aggregate percentage of 77.4 per- 
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cent for the database as a whole. This very high percentage of Dewey items may suggest 
that the libraries for A-hich data were analyzed have been more assiduous in reporting hold- 
ings for the classified portions of their collections than for fiction and other unclassified 
materials. The libraries are very similarly balanced in terms of where their holdings lie. 
Pearson correlations among the seven libraries by numbers of items in each Dewey class 
(broken down to the Dewey tens) yielded coefficients between .92 and .99, indicating that 
the balance of hOidings in each Dewey tens class was very similar for all the libraries in the 
sample. 

TABLE 2: BALANCE OF VEVsTY AND NON.DEV\T,Y HOLDINGS 

DEWEY NON- DEWEY DEWEY 
LIBRARY HOLDINGS HOLDINGS PERCENT 



1 


10,435 


2,464 


60.9% 


2 


17,724 


3,104 


65.1% 


3 


71,405 


25,368 


73.6% 


4 


11,397 


3,495 


76.5% 


5 


35,695 


7,105 


63.5% 


6 


13,773 


1,403 


75.6% 


7 


27,786 


9,139 


75.2% 


TOTAL 


188,415 


55,076 


77 .4% 



Data Analysis 

The major focus of the data analysis was the search for patterns within the relation- 
ship between call number ranges and ages of items. Analyses were conducted in a number 
of ways, including examination of all libraries as a group in addition to individual libraries, 
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all Dewey items as a group as well as several variations on Dewey ranges, and all non- 
Dewey items as a group. Non-Dewey items were not further subdivided, and all analysis 
thus far nas dealt with Dewey and non-Dewey items separately rather than in the ag- 
gregate. 

Patterns were examined by creating statistical profiles of variations relationships be- 
tween call number and age and by constructing graphic representations of the relation- 
ships. The mean, median and maximum age values for groupings by Dewey 100s for each 
library and for all libraries as a group are shown in Table 2. 

As can be seen from Table 2, there are fairly consistent but not especially great 
variations in the mean and median values. The mean age for all Dewey items ranges from 
4.875 years for Library 2 to 14.596 years for Librar>' 5.^^ The median age ranges from four 
to thirteen years. The mean age for all libraries ranges from 7.056 years for the Dewey 
800s to 10.745 for the Dewey 400s; the median ranges from six to eight years. The patterns 
for individual libraries vary somewhat, but overall the patterns for different libraries and 
different Dewey classes are remarkably similar. Although the differences in means among 
the libraries and among the Dewey classes are statistically significant (oneway analysis of 
variance reveals a probability level of .001 in both cases), it is hard to attach much impor- 
tance to differences that are numericiMy rather small. The differences among the mean 
ages are represented graphically in Figures 1 though 12. These graphs reiterate the consis- 
tency of the patterns among libraries and among Dewey classes. 
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TABLE 3 : MEAN. MEDIAN AND MAXIMUM VALUES 
DEWEY lOOs BY LIBRARIES 



DEWEY 



LIB 1 LIB 2 



LIB 3 LIB 4 LIB 5 LIB 6 LIB 7 ALL LIBS 



0009 

MEAN 

MEDIAN 

MAX 
1008 

MEAN 

MEDIAN 

MAX 
2008 

MEAN 

MEDIAN 

MAX 
3008 

MEAN 

MEDIAN 

MAX 
40011 

MEAN 

MEDIAN 

MAX 
5008 

MEAN 

ME.DI7VM 

MAX 
6008 

MEAN 

MEDIAN 

MAX 
7008 

MEAN 

MEDIAN 

MAX 
8008 

MEAN 

MEDIAN 

MAX 
9008 

MEAN 

MEDIAN 

MAX 



4.724 
4.000 
18.000 

5.171 
4.000 
36.000 

6.503 
4.000 
107 .000 

5.025 
4.000 
134.000 

6.793 
5.000 
64.000 

5.432 
4.000 
36.000 

5.672 
5.000 
84.000 

6.211 
5.000 
90.000 

4.959 

4 >000 
92.000 

6.749 

5.000 
118.000 



5.229 
5.000 
35.000 

5.686 
4.000 
52.000 

5.633 
5.000 
52.000 

4 671 
4.000 
89.000 

7.416 
6.000 
35.000 

5.925 
5.000 
35.000 

4.972 
4.000 
90.000 

5.516 
4.000 
90.000 

3.988 
4.000 
83.000 

5.163 
4.000 
109.000 



5.967 
6.000 
90.000 

6.730 
5.000 
52 .000 

7.344 

6.000 
52 .000 

5.556 
5.000 
90 .000 

9.240 

7 .000 
90 .000 

6.760 
6.000 
54.000 

6.011 
5.000 
90 .000 

6.768 

6.oon 

90.000 

5.836 
5.000 
90 .000 

6.224 
5.000 
103.000 



7.981 
7.000 
33.000 

11.068 
9.000 
45.000 

12.531 
9.000 
53.000 

9.226 
8.000 
118.000 

14.831 
12.000 
42.000 

10.669 
8.000 
64.000 

9.148 

8.000 
95.000 

12.668 
9,000 
86.000 

6.475 
5.000 
90.000 

13.489 
9.000 
128.000 



14.000 
13.000 
93.000 

14.631 

13.000 
80.000 

15.345 
13.000 
89.000 

14.050 

13^000 
81.000 

16.731 
14.000 
84.000 

15.402 
14.000 

72.000 

14.539 
13.000 
68.000 

14.792 
14.000 
94.000 

13.?1B 
13.000 
89.000 

15.856 
14.000 
140.000 



7.581 
7.000 
49.000 

8.628 
6.000 
40.000 

9.023 

7r000 

52.000 

7.039 
5.000 
81.000 

13.101 
9.000 
92.000 

8.855 
7 .000 
54.000 

8.135 
7.000 
60.000 

8.659 
7.000 
84.000 

6.370 
5.000 
90.000 

10.838 
7.000 
139.000 



6.278 
6.000 
25.000 

6.947 
6.000 
53.000 

6.726 
6.000 
41.000 

6. 105 
5.000 
53.000 

8.960 
7 .000 
103.000 

6.777 
6.000 
52.000 

6.758 
6.000 
41.000 

9.451 
6.000 
95.000 

6.326 
5.000 
94.000 

6.381 
6.000 
60.000 



7.702 
7.000 
93.000 

8.579 
7.000 
80.000 

9. 123 
7.000 
107.000 

7.409 
6.000 
134.000 

10.745 
8.000 
103.000 

8.410 
7.000 
72.000 

8.014 
7.000 
95.000 

9.247 
8.000 
95.000 

7.0S6 
6.000 
94.000 

8.987 
7.000 
140.000 



ALL 
MEAN 
MEDIAN 
MAX 



5.495 
4.000 
134.000 



4.875 

4 . 000 
109.000 



6.145 
5.000 
103.000 



9.096 
7 .000 
128.000 



14.596 
13.000 
140.000 



7.998 
6.000 
139.000 



6.813 
6.000 
103.000 



8.012 
6.000 
140. 000 



Although the patterns of mean, median and maximum values are of interest, more 
meaningful information is gained by graphing the entire range of values for some combina- 
tion of call number range, library and age of item. It is conventional to construct graphs in 
a cumulative manner. Previous obsolescence studies have tended to reveal the pattern 
shown in Figure A.^^ This curve shows a very rapid rise representing a clustering of rela- 
tively young items, falling into a declining rate of cumulation representing the older por- 
tion of the collection. Bourne used this common bibliometric distribution to posit the 
"ninety percent library" as a decision making model for library practice. "^-^ In age analysis, 
the selection of some retention percentage can be used to define target items that will be 
candidates for some kind of weeding action. Those itemi. below the targeted percentage 
level can be assumed to be contributing members of the collection and will no: be con- 
sidered candidates for weeding, while those items above the targeted percentage level will 
be examined in some way to determine whether action shoulc. be taken to either remove 
them from the collection or enh.ance their likelihood of use. Figure A projects lines for the 
eightieth and ninetieth percentiles. In practice, the desired retention rate would be 
decided at the local level for an individual library, based on such considerations as 
availability of shelf space, the focus of collecuon management efforts, budget allocation*, 
circulation characteristics of the collection or detailed examination of the aging patterns of 
specific groups of materials in the collection. 

One of the striking characteristics of the body of data used for this study is the con- 
si.stency of patterns. If the entire database is examined, the pattern is very much in keeping 
with that of Figure A. This pattern is also seen when the collections of individual libraries 
are considered, as shown in Figure 109. Overall, the pattern shows a rapidly rising line that 
begins to fall into a flatter line at about seventeen to eighteen years. The same basic pat- 
terns prevail for individual Dewey call number ranges, particularly when grouped across all 
libraries. When call number gioups are graphed for individual libraries, the patterns are 
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somewhat less smooth, but still fall into the general curve of Figure A. The curve tends to 
be somewhat ragged for the Dewey 000s, 100s, 300s, and 500s but much smoother for the 
300s, 600s, 700s, 800s and 900s. 

The aging patterns for Dewey ranges (by hundreds) tend to show Library 5 as the 
oldest, but the anomaly noted above clearly influences this. Library 4 is consistently second 
oldest, followed by Library 6. Library 7 and Library 3 trade positions for fourth and fifth 
oldest, depending on call number rangfe. Libraries 1 and 2 trade places fer sixth and 
seventh oldest, with Library 2 most consistently the "newest" collection. This suggests that 
the libraries' collections do have different characteristics and histories, despite the overall 
similarity in aging patterns. 

The Dewey 400s were consistently the oldest class of materials, representing the 
greatest mean age for all libraries except Library 7. The Dewey 000s, 300s, and 800s were 
the youngest classes, although none of them was universally younger than the others. 

Sample Application of Age Data to Collection Analysis 

As was mentioned earlier, specific application of age analysis to collection manage- 
ment decision making must be based c i local objectives and the availability of resources 
for conducting the analysis. As an example of how age data might be incorporated into 
selecting target items for decision making, two approaches to targeting will be described. 
The first makes use of an arbitrary retention percentage of ninety percent; that is, the 
oldest ten percent of the collection will be examined. The second approach makes use of 
the point at which the curve for each Dewey major class (Dewey hundreds) makes the 
i msition from the dramatic rise to the the flatter tail of the distribution. Library 1 has ar- 
bitrarily been chosen for these examples; only the Dewey portion of the collection is in- 
cluded in the examples. 
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Profile 1: The Ninety Percent Retention Rate 



The first method for targeting items for examination consists of lookiiig at the 
oldest ten percent of items either across the board or in various classes (that is, selecting a 
ninety percent retention rate). It seems likely that examining items within individual 
classes will be more useful than an across the board approach, although the results should 
ultimately be very similar. For Library 1, with a Dewey collection of 10,435 items, the 
oldest ten percent comprises 1,044 items (all percentages are rounded upward to prov^'de 
for a slight oversampling). This would involve those items approximately ten years old or 
older. Table 4 shows the target samples that would be chosen if analysis were conducted at 
the level of the Dewey hundreds rather than across the board. Analysis at this level results 
in class age cutoffs from eight to twelve years, and a total target sample of 1,047 items, a 
miniscule oversampling in comparison to the across the board ten percent examination 
figure. Assuming that examining each item targeted would require an average of about 
twenty minutes of professional level staff lime, conducting an analysis based on ninety per- 
cent retention for Library 1 would require 349 person/hours, or almost nine weeks of ef- 
fort. 



Profile 2: Curve Examination 



Examining the transition points in the actual distributions produces substantially dif- 
ferent results, also shown in Table 4. The graph of the collection taken as a whole appears 
to make the transition at about 14 years, with a target sample of 629 items, only six percent 
of the population of 10,435 items. In most cases, the transition for the ten Dewey classes 
approximates the point at which there are fewer than ten items of each age; there is an ele- 
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ment of arbitrariness in this approach that allows for refinement in local decision making. 
The most obvious result of this approach as applied for this example is that tht total num- 
ber of items to be examined is substantially less. "When individual Dewey class distribu- 
tions were examined, 572 items were targeted, representing 5.5 percent of the Dewey 
population for Library 1. This would require about 191 person liours, not quite five weeks, 
for analysis. 



TABLE 4: COMPARISON OF METHODS FOR IDENTIFYING TARGET ITEMS 



DEWEY 


TOTAL 


90% 


AGE 


AGE 


GRAPHIC 


RANGE 


HOLDINGS 


SIMPLE 


CUTOFF 


CUTOFF 


SAMPLE 










FROM 












GRAPH 




000s 


308 


31 


8 


11 


17 


100s 


240 


24 


11 


7 


46 


200s 


169 


17 


10 


12 


14 


300s 


1827 


183 


8 


15 


62 


400s 


87 


9 


10 


IS 


5 


5008 


507 


51 


10 


12 


41 


600s 


1670 


167 


10 


16 


64 


700S 


1254 


1?6 


12 


17 


67 


600s 


3212 


322 


8 


17 


143 


9008 


1151 


117 


12 


13 


113 


TOTAL 


10425 


1047 


10 


14 


572 
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Comparison of the Percentage and Curve Examination Methods 



The major difference between the selection of a desired retention percentage and 
examination of the curve of the actual age distribution is the extent to which the latter 
method can be fine tuned to meet very specific decision making goals. If the objective is to 
examine as few items as possible while still maintaining a sound, systematic basis for target- 
ing items, the curve examination method, carefully applied, is very attractive. This 
flexibility is useful for accommodating goals related to collection balance and growth, since 
individual sections of the collection can be handled separately. Although the retention 
percentage approach c?n also be tailored to specific sections of the collection, determining 
discrete retention percentages for subsections of the collection may be a difficult process 
and will inherently require an element of truly arbitrary decision making. At the same 
time, a disadvantage of the curve examination method is that it is substantially less 
mechanical than the retention percentage approach, and therefore requires substantially 
more intellectual effort on the part of the professional level employee charged with iden- 
tifying the target sample. Determining an appropriate, meaningful and useful transition 
point requires developing a "feel" for the body of data being considered, and also requires 
being able to relate graphic data to the kind of tabular presentation produced by a spread- 
sheet. 



Continuation crthe Project 



Like most truly interesting research, the Age Analysis of Public Library Collections 
project cannot be considered truly complete. It is only the beginning of a long-term 
research process. The project has turned out to be substantially more ambitious than the 
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researchers' had realized at the outset. The wealth of the data provided by OCLC is 
phenomeral, considering its seeming simplicity. Describing and understanding the aging 
patterns of the data collected have occupied most of the period for which funding was 
provided, and will continue to be the focus of research efforts for some time. Only general 
recommendations for library pra'^tice can be made at this time; more specific procedures 
and rules for expediting them will need to follow further analysis of the data. The develop- 
ment of programs for computerized analysis, which was initially viewed as being one 
product of the study, has yet to be accomplished. The project, although it has revealed 
much of interest about the age characteristics of public library collections, must be con- 
sidered a work still in progress. The researchers plan to extend the research to include the 
following: 

1. further, more detailed analysis of the OCLC data, including comprehensive 
analysis of both the Dewey and non-Dewey records; 

2. development of specific recommendations and procedures for utilizing age 
information in making collection management decisions; 

3. application of the age analysis process to an individual library in a case study 
mode of operation; 

4. development of computer programs for age analysis. 

It seems evident, based on the analysis conducted up to this time, that studying an in- 
dividual library is a necessary next ste^ and is a prerequisite to any definitive statement 
regarding the efficacy of detailed age analysis as a collection management tool. This case 
study should ideally involve application of a variety of weeding methods, including the 
more traditional approaches already in general use as well as the retention percentage and 
curve examination methods described in this report. The data analyzed for this study seem 
to emphasize the attractiveness of detailed age analysis as a collection management tool. 
The development of an easy to use computer analysis program capable of working with ex- 
istinf^ local databases should be fairly simple, assuming that the consistency of the aging 
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patterns observed for the seven libraries studied is typical. There is no reason to believe 
that the data examined are not typical, at least of libraries in the size range of those 
studied. Although the objectives of the project have not been fully achieved, then, they 
certainly appear to be achievable, given further time, thought and effort. It is the fervent 
intent of the research team to apply such time, thought and effort and build from the foun- 
dation that has been laid to a more comprehensive understanding of the role of age 
analysis in evaluating library collections. A major benefit of the funded stage of the Age 
Analysis of Public Library Collections project has been the establishment of a solid base 
around which further research can be shaped. 



Dissemination of Project Results 



The work reported here has been disseminated in a number of public forums. 
Reports of the work in progress have been delivered to the Louisiana State Library Ad- 
ministrative Librarians' Conference (Baton Rouge, December 1990), the Louisiana Library 
Association Annual Conference (Monroe, March 1990), the Association for Library and 
Information Science Education Amiual Conference (Chicago, January 1990), and the 
American Library Association Annual Conference (Dallas, June 1989). The project 
formed the basis for two all-day workshops on "Effective Collection Weeding" given for 
the Northeast Texas Library System (McKinney and Quitman, October 1990). A talk on 
the project, its findings and their implications for library practice will be given at the 
March, 1991 National Conference of the Public Library Association. An article describing 
the focus of the work appeared in Public Libraries, the official journal of the Public Library 
Association.^*^ It is anticipated that s^'iveral other articles and presentations will be 
produced, including at least one article for a major library and information science 
research journal. Responses to the work have been frequent and positive. Many librarians 
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have expressed an interest in exploring how age analysis could effectively be used in their 
individual libraries, and several have volunteered to have their libraries serve as test sites 
for further study. The research team anticipates selecting one or more of these volunteer 
libraries as a case study test site for the next stage of the project. 
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